fnTEXT  Systems:  Unique,  Intelligent  Tools  for  the  Internet 

Only  InTEXTs  powerful  Software  Development  Kits  (SDKs)  provide  the  next  generation  value-add  that  today's 
users  expect  from  Internet  and  Web  products.  InTEXTs  unique  technology  provides  the  time-to-market  advantage 
you  need  to  help  knowledge  workers  and  consumers  benefit  most  from  your  product: 

•  Profiling  with  natural  english — easily  create  user  profiles  with  simple  phrases  and  seed  sentences 

•  Self-tuning  agents — employ  learning  agents  to  monitor  new  information  as  it  becomes  available 

•  Summarizing — create  document  abstracts  on  the  fly:  no  indexes,  no  topic  trees,  no  semantic  networks 
•Automatic  HTML  &  hyperlink  generation — quickly  &  easily  publish  documents  to  the  Web 

•  HTML-to-HTML  search  &  retrieval— search  HTML  documents  &  retrieve  them  with  all  links  intact 

•  Natural  language  querying — find  crucial  information:  no  complicated  query  structures  to  leam 

•  Relevancy  profiling  &  retrieval  based  on  content—know  what  information  most  applies,  instantly 

•  Multilingual  engines — ^French,  German,  Spanish,  Chinese  and  more... 

•  Multi-platform  support — ^Windows  &  major  UNK  platforms 

•  Scalable  architecture — SDKs  deploy  able  with  browsers,  servers  &  on-line  services 

InTEXT  Systems'  Intelligent  Internet  Development  Tools  are  based  on  a  powerful  Heuristic/Learning  architec- 
ture, which  uses  skim-reading,  comprehension  and  self-tuning  techniques  to  understand  the  content  of  informa- 
tion. Where  other  technologies  force  developers  to  perform  extensive  indexing  and  create  tremendous  overhead, 
InTEXTs  technology  provides  real-time  performance  with  little  or  no  indexing  and  minimal  overhead: 

InTEXT  Object  Roufer  Agent  SDK  InTEXT  Retrieval  Engine,  Webserver  SDKs 


Automatically  monitors  live  data,  including  news 
groups  and  wire  feeds,  and  channels  it  directly  to 
users'  desktops  without  requiring  indexing. 

•  On-the-fly  information  profiling 

•  Natural  language  filter  creation 

•  Dynamic,  self-tuning  filters 

•  Scalable,  client/server  toolkit 

InTEXT  Object  Analyzer  Agent  SDK 

Creates  document  abstracts  on  the  fly,  reducing 
network  overhead  and  saving  users  from  information 
overload. 

•  On-the-fiy  document  summarizer 

•  Automatic  keyword  generator 

•  Internal  hypertext  linking 

•  Powerful  application  development  base 

InTEXT  Precision  Agent  SDK 

Reduces  full-text  indexes  tenfold,  saving  disk  space 
and  increasing  both  speed  &  accuracy  significantly. 

•  Dramatically  reduced  server  indexes 

•  Rapid  retrieval  and  unparalleled  accuracy 

•  Powerful  platform  for  application  development 


Delivers  instant  access  to  the  world  of  on-line  infor- 
mation by  supporting  scalability,  extensibility  and 
compatibility. 

•  Powerful  search,  retrieval  and  storage  APIs 

—  WAIS,  Z39.50  and  HTML  compliance 

—  Document  mgmt,  RDBMS  &  4GL  extensibility 

•  Dynamic  relevancy  ranking 

•  Several  search  facility  options 

—  Full  boolean,  fuzzy,  phrase,  proximity,  free  form 

english  searches  and  many  others... 

•  Native  document  display  and  output,  inch  HTML 

•  Multiple  index  and  security  levels 

•  Multilingual,  inch  European  &  Asian  languages 

•  Windows,  DOS,  UNIX  and  Mainframe  support 

•  Client/Server  architecture  supporting  TCP/IP  and 
SNA  LU6.2  connectivity 

InTEXT  NLQ  SDK 

Lets  all  users  access  the  information  they  need  using 
plain  english  and  seed  sentence  queries. 

•  Minimal  user  learning  curve 

—  Access  information  with  "normal"  language 

•  Promote  on-line  searches 

•  Intuitive  for  content-based  searches 

•  Integrates  with  3rd  party  search  engines 


Call  today  for  a  free  white  paper:  Tools  for  the  Internet:  Channeling  the  Information  Tidal  Wave 


InTEXT 

CP  Software  Gnsup 


Worldwide  Headquarters,  120  Montgomery  Street,  Suite  450,  San  Francisco,  CA  94104 
telephone: +^  415/391-5290,  facsimile: +1  415/391-4996,  /jome  page;  HTrP://www.intext.com 
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/a/TEXTRetrieval  Engine:  Web  Search  and  Retrieval 
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The  InTEXT  RETRffiVAL  Engine 
is  a  content-based  document 
Search  and  Retrieval  Engine 
FOR  organizational  textbases, 
the  Internet  and  the  World 
Wide  Web.  It  supports  UNIX, 
MVS/CICS,  MVS/TSO,  VAX/ 
VMS,  Open  VMS,  Windows 
AND  MS/DOS. 


"The  InTEXT 
Retrieval  Engine  is 
the  most  powerful 
and  diverse  docu- 
ment storage  and 
retrieval  product 
available  in  the 
industry  today. " 


InTEXT 


Internet  and  Web  Search  Engine 

The  /nr^Xr  Retrieval  Engine  is  WAIS  (Wide  Area  Information 
Server),  Z39.50  and  HTML  compliant  and  can  be  used  as  a 
search  engine  for  the  Internet  and  the  Web.  For  example,  develop- 
ers can  use  InTEXT  to  provide  powerful  query  capabilities  to 
Internet  and  Web  textbases  so  users  can  find  relevant  on-line 
information. 

Dynamic  Document  Search!  and  Retrieval 

Free-form  natural  language  querying,  full  Boolean  searching  and 
positional  relationship,  the  /nr^XT  Retrieval  Engine  has  it  all. 
From  Windows,  UNIX  chents  or  character  terminals,  users  can 
formulate  both  structured  and  free-form  English  queries  to 
initiate  on-line  information  searches. 

Dynamic  Relevancy  Ranking 

The/nTfiXTRetrieval  Engine  provides  the  most  sophisticated 
retrieval  and  document  ranking  mechanism  available  today. 
Based  upon  a  content  understanding  architecture,  the  Retrieval 


Engine  automatically  self-tunes  to 
the  document  collection  as  it 


changes. 


It  reflects  alterations  in  the  rel- 
evancy percentages  assigned  to 
documents,  which  are  presented  to 
users  in  new  priority  orders. 

Further,  users  can  assign  individual 
weights  to  their  search  criteria  to 


subjectively  influence  document 
priority  ranking. 

Native  Document  Display  and 
Output 

Once  users  find  what  they  want,  the 
/nTEXr  Retrieval  Engine  provides 
many  ways  to  view  the  results. 
Users  can  look  at  a  list  of  document 
titles,  sort  them,  natively  view  those 


pages  that  contain  a  search  term  or 
output  the  results  to  a  file  or  printer. 

Enterprise-Wide  Solution 

If  your  organization  is  lost  in  a 
document-intensive  world,  the 
/nTEXT  Retrieval  Engine  is  a  truly 
cost-efficient  solution.  Running 
across  the  enterprise,  InTEXT  tarns 
the  mass  of  electronic  information 
into  a  business  advantage. 


Search  Facilities: 

•  Full  Boolean  Logic 

•  Free  Form  Natural  Language 

•  Structured  Data  (e.g.  dates,  names) 

•  Fuzzy  Logic 

•  Phrase 

•  Proximity 

•  Stemming  and  Wildcard 

•  Synonyms 

•  Structured  Thesaurus 

•  User  Weighted  Search  Terms 

•  Unlimited  Search  Scoping 

Display  Facilities: 

•  Automatic  Results  List  Ranking 

•  Search  Term  Highlighting 

•  Document  Display  by  Title,  Search 
Term  in  Context,  Smart  Full-Text, 
Document  Parts 

•  Output  Results  to  Terminal,  File, 
Printer,  Other  Applications 

System  Features  and  Facilities: 

•  Powerful  Macro  Language 

•  User  Customizable  Search  Screens 

•  Inverted  Index,  Built-in  Compression 

•  Dynamic  Real-Time  Indexing 

•  Strong  Security 

•  Batch  Processing 

•  On-Line  Help 

•  Flexible  Textbase  Structure 

•  Client/Server  Architecture 

•  Full-Featured  API 


Features 


Benefits 


Natural  Language  Query 


Dynamic  Index  Management 


Definable  Index  Levels 


Powerful  Application  Development  Base 


World  Wide  Web  and  Internet 
Search  and  Retrieval 


Access  info  with  "normal"  language 
No  new  languages  to  learn 
Minimal  employee  training 
Eliminate  manual  searches 


Easy-to-change  indices 
Dynamic  textbase  update 
Tremendous  timesaver 


Objective  specification 
Subjective  specification 
Customization 
Ease  of  use 


Flexible  development  with  macros 
"Learning"  applications 
API  Library  for  customization 
Dynamic  rather  than  static 
Easy  to  change,  upgrade,  administer 
Client/Server  architecture  provides 
enterprise-wide  scalability 
Supports  popular  textbases  such  as 
STATUS™ 


Supports  WAIS,  Z39.50  and  HTML 
protocols 


InTEXT 

World  Headquarter 

715  Sutter  St 
FolsomCA  95630 
Telephone:  (916)  985-4445 
FflCJimife;  (916)  985-3557 
HTTP://www.intexLcom 

Copyright  ©  1 995  /nTECrSystems,  Inc.InTEXT,  Webserver  and  Heuristic/Learning  are  trademarks  of/nr£Xr  Systems.  All  other  names  and  products  are  trademarks  of  their  respective  owners. 
M7006-C  06/95. 


United  States 

120  Montgomery  St 
Suite  450 

San  Francisco  CA  94104 
re/ep/ione;  (415)  391-5290 
FflCiimi/e;  (415)  391-4996 


Australia/New  Zealand 

Cnr  Geils  Circuit  and  Denison  Sts 
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Australia 
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United  Kingdom 
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/yvTEXT  Enterprise  Manager:  Distributed  Doc  Mgmt 


t,,,i 


,1,  J,-M/  /^/- 


/>  * 


^ 


Author  Station 

ADD,  REVISE,  DELETE, 

FIX,  MIGRATE 


Retrieval  Station 

SEARCH,  DISPLAY, 

PRINT 


Enterprise  DocumenP 
Repository 


CICS 

Enterprise 
Manager 


DB2 

Control 

Table 


InTEXT 

Webserver 


fT^ 


3270 
Display 


Local  Document 
Repository 


OS/2  Enterprise 
Manager 


Database 

Manager 

Control  Table 


The  InTEXT  Enterprise 
Manager  provides  document 
storage,  management,  and 

retrieval  ACROSS  AN  ENTERPRISE 
ENVIRONMENT  INCLUDING  MAIN- 
FRAME, Windows,  OS/2,  UNIX 

AND  CHARACTER  WORKSTATIONS. 


"InTEXT  Enterprise 
Manager  provides 
the  only  desktop- 
LAN-mainframe 
based  distributed 
document  manage- 
ment solution 
available  today. " 


Enterprise-Distributed  Applications 

Building  Windows-based  applications  that  communicate  with  IBM  MVS/ 
CICS  has  never  been  easier.  /nTEXr  Enterprise  Manager  is  the  solution  for 
creating  distributed  document  management  applications.  WiihlnTEXT 
Enterprise  Manager,  users  have  full  access  to  Document  Management 
functions  such  as  full  check-in/check-out  management  and  document 
migration  across  LANs  and  WANs — all  from  their  desktops. 

Document  Management 

Besides  mainframe-to-desktop  communication,  Enterprise  Manager  has  a 
powerful  set  of  application  programming  interfaces  (APIs)  for  managing 
large  volumes  of  documents  and  provides  the  ability  to  add,  update,  delete, 
get,  fix  and  migrate  documents.  Documents  can  reside  on  the  LAN  and  are 
known  to  all  LANs  in  the  WAN.  They  can  also  reside  on  the  mainframe 
and  are  available  to  all  servers. 


InTEXT 


Full-Text  Retrieval 

Enterprise  Manager  is  compatible  with  many  popular  text  retrieval  prod- 
ucts, including  STATUS™.  Documents  that  are  moved  to  the  mainframe 


can  be  added  to  or  deleted  from 
STATUS  textbases.  These  docu- 
ments are  then  available  for  full-text 
retrieval  using  STATUS  either  via 
character  based  terminals  or  GUIs. 

Built-in  Support  for 
Communication  Protocols 

With  Enterprise  Manager,  there  are 
no  more  networking  worries.  All  of 


the  protocols  are  built  in.  Whether 
using  SNA  APPC  or  TCP/IP  at  the 
mainframe  level  or  Token  Ring, 
TCP/IP  or  Named  Pipes  at  the 
server  level,  communication  and 
document  transfer  occur  seamlessly. 

Portability 

The  /nlEZr  Enterprise  Manager 
can  be  used  in  a  number  of  configu- 


rations. Please  discuss  any  special 
requirements  with  your  InTEXT 
Systems  representative. 


Features 


Benefits 


MVS  to  Windows  Tools 


Distributed  application  development 

toolkit  for  enterprise-wide  document 

management 

Leverage  existing  investment  in  legacy 

applications 

Access  all  mainframe  information  assets 


Document  Management 


Complete  check-in/check-out,  add, 

update,  delete,  get,  fix,  migrate 

functionality  for  full-scale  document 

control 

Enterprise-wide,  heterogeneous 

environment  for  seamless  sharing  of 

documents 


Open  Architecture 


Can  be  ported  to  other  platforms  and 
other  LAN  environments 
Supports  multiple  communication 
protocols 

Compatible  with  InTEXT  Webserver 
and  STATUS  for  Internet  and  Web 
document  publishing,  search  and 
retrieval 


InTEXT 
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InTEXT \NebPak:  Client/Server  Development  Environment 
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The  InTEXT  WebPak  is  the 
world's  first  suite  of  prod- 
ucts AND  tools  for  ADDING 

intelligence  to  Web-based 
document  publishing,  profil- 
ing, HTML-TO-HTML  search 

&  RETRIEVAL,  NATURAL  LANGUAGE 
querying  and  SUMMARIZATION 

for  both  the  unix  and 
Windows  environments. 


"The  WebPak 
Client/Server 
Development 
Environment  helps 
organizations 
quickly  create 
intelligent  Web 
sites,  publish  Web 
pages  at  the  click  of 
a  button  and 
efficiently  leverage 
their  information 
assets  in  the  on-line 
world." 


InTEXT 


Intelligent  Web  Site  Deployment 

Combining  a  Windows-based  automated  publishing  environment, 
Web-ready  applications  for  rapid  deployment  and  a  well- versed 
combination  of  Software  Developer's  Kits  (SDKs),  the  InTEXT 
WebPak  provides  Web-site  developers  with  intelligent  tools  for 
quickly  deploying  advanced  Web-based  applications. 

Automated  Web  Publishing— Island  InTEXT 2.0 

Using  the  WebPak,  appUcation  developers  can  build  Web  pub- 
lishing sites  that  support  popular  Web  browsers,  such  as 
Netscape  and  Mosaic,  WAIS  and  Z39.50-compliant  Internet 
clients  and  leading  full-text  retrieval  databases. 

These  Web  site  applications  can  support  several  unique  capabili- 
ties, including  free-form  English  user  queries,  live  document 
routing,  real-time  summarization,  content-based  searching, 
relevancy-ranked  retrieval,  automatic  hyperlink  and  keyword 
generation  and  automatic  Web  creation. 


/n TEXT  Object  Router  SDK 

Included  in  the  WebPak  kit,  the 
ItiTEXT  Object  Router  Intelligent 
Agent  SDK  automatically  monitors 
live  data,  including  news  groups  and 
wire  feeds,  and  channels  it  directly 
to  users'  desktops  without  requiring 
indexing. 

/nTEXr  Object  Analyzer  SDK 

This  document  summarizing  toolkit 
creates  document  abstracts  on  the 
fly,  reducing  network  overload  and 
saving  users  from  information 
overload. 


/nrEXr  Webserver  SDK 

Based  on  the  powerful  InTEXT 
Retrieval  Engine,  the  Webserver 
delivers  instant  access  to  the  world 
of  on-line  information  by  supporting 
scalability,  extensibility  and  compat- 
ibility. The  Webserver  provides 
relevancy-ranked  retrieval,  based  on 
document  content,  and  supports 
HTML,  WAIS  &  Z39.50  protocols. 

/nTEXT  NLQ  SDK 

The  InTEXT  NLQ  Engine  is  the 
most  advanced  free-form  English 
facility  of  its  kind.  It  lets  users 


access  the  information  they  need 
using  plain  english  and  seed  sen- 
tence queries,  minimizing  and  even 
eliminating  syntax  training. 

Intuitive  Architecture 

The  InTEXT  WehPak  products  and 
SDKs  are  based  on  InTEXT  Sys- 
tems' Heuristic/Learning™  architec- 
ture that  analyzes  document  surface 
structures  to  gain  content  knowledge 
of  live  and  stored  information.  This 
unique  architecture  makes  the 
InTEXT  technology  critical  tools  for 
the  highly  interactive  and  changing 
content  of  the  Internet  and  the  Web. 


Features 


Benefits 


Automated  Web  Publishing 


HTML  conversion  layer  and  automatic 
hyperlink  generation  let  anyone  create 
Web  pages 


Profiling  with  Natural  English 


•  Easily  create  user  profiles  with  simple 
phrases  and  seed  sentences 


Self-Tuning  Agents 


Employ  learning  agents  to  monitor  new 
information  as  it  becomes  available 


Summarizing 


Create  document  abstracts  on  the  fly 
and  avoid  building  indexes,  topic  trees 
and  semantic  networks 


Relevancy  Profiling  and  Retrieval 


Users  instantly  know  what  information 
most  applies  to  their  information  needs 


World  Wide  Web  and  Internet 
Search  and  Retrieval 


Supports  WAIS,  Z39.50  and  HTML 
protocols 


Scalable  Architecture 


•  SDKs  deployable  with  browsers,  servers 
and  on-line  services 


InTEXT 
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InTEXT  Object  Router:  Intelligent  Agent  Toolkit 


The  InTEXT  Object  Router 
intelligent  agent  toolkit 
routes  live  and  stored  infor- 
mation to  users  based  on  their 
interest  profiles — without 
requiring  indexing.  object 
Router  is  a  powerful  tool  for 
managing  the  tidal  wave  of 
on-line  information. 


"Object  Router's 
content-based 
approach  stream- 
lines the 
organization's 
business  process  by 
using  intelligent 
agents  to  route 
information  based 
on  a  user's  definition 
of  what's  important." 

-Delphi  Consulting, 
October  1994 


InTEXT 


On-the-Fly  Information  Routing 

With  the  Object  Router  technology,  users  can  engage  intelligent 
agents  to  search  their  organization's  databases  and  on-line  news 
feeds  for  specific  data  and  dynamically  route  it  to  user-defined 
folders  for  easy  access  and  usage. 

Object  Router  allows  users  to  identify,  in  natural  language,  the 
topics  in  which  they  are  interested.  As  an  intelligent  database  and 
network  agent  toolkit.  Object  Router  simply  watches  for  those 
topics  and  brings  them  to  user  desktops. 

Filter  Intelligence 

Object  Router  filters  learn  from  the  stream  of  documents  and 
dynamically  tune  themselves  to  better  discriminate  between 
relevant  and  irrelevant  messages  each  time  they  are  confronted 
with  new  subjects. 

The  filters  become  more  effective  as  they  mature  and  become 
more  sensitive  to  rapidly  changing  issues,  giving  users  faster 
access  to  the  most  critical  information,  automatically. 


Before  being  routed  to  users' 
electronic  mailboxes,  documents  are 
ranked  based  on  the  material's 
relevancy  to  user-defined  subjects. 
Users  have  the  flexibility  to  specify 
how  much  priority  a  specific  subject 
should  have  compared  to  other  filter 
subjects. 

A  single  filter  can  serve  multiple 
recipients,  each  with  a  different 
relevance  threshold.  Some  users 
may  wish  to  see  messages  if  they  are 


central  to  the  topics  covered  in  a 
filter,  while  others  may  adopt  a 
more  general  information  gathering 
approach.  Object  Router  caters  to 
the  various  needs  within 
workgroups. 

On-Line  Profiling  and  Routing 
Solutions 

Object  Router  is  part  of  InTEXT 
Systems'  complete  line  of  content 
management  solutions.  It  can  be 


used  by  corporations  and  institu- 
tions to  create  customized  informa- 
tion profiling  and  management 
solutions  for  their  own  databases 
and  for  the  Internet. 

Open  Architecture 

The  Object  Router  toolkit  is  a  C 
language  subroutine  library  that 
allows  programmatic  access  to 
Object  Router's  key  features.  It  is 
available  for  UNIX  and  Windows. 


Object  Router 

Intelligent 
Content  Agent 


mrnr 


Filters 


i^^ 


1 1     1    -• 

User  A 

UserB 

UserC 

The  Object  Router  Intelligent  Content 
Agent  is  an  electronic  information  sleuth 
that  scans  organizational  and  Internet  data 
and  routes  it  to  appropriate  users. 


Features 


On-the-Fly  Routing 


Filtering/Routing/Classification 


Filter  Intelligence 


Beneflts 


•  No  indexing  required 

•  The  most  appropriate  documeiits 
automatically  route  to  interested  users 

•  Incoming  e-mail  automatically  sorted 
into  topic  folders 


•  More  timely  access  to  current 
information 

•  Access  to  only  the  documents 
people  want 

•  Reduces  information  overload 

•  Reduces  network  traffic 


Filters  that  learn  from  the  stream  of 
information  and  automatically  tune  to 
the  changing  nature  of  information 
Users  receive  leading  edge  information 
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InTEXT  Precision:  Dynamic  Document  Content  Agent 
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File     Edit     Options 


InTEXT  Precision  Demonstrator  -  C;\WiNWORD\INTE>a^BOB.DAT 
View 


Ml 


SECTION! 

1.1  SCOPE     4 

1.2  OVERVIEW  4 

1.3  BACKGROUND     4 

1.4  REFERENCE  DOCUMENTS  6 
SECTION  II 

2.1  SYSTEM  DEFMITIOH  7 
SECTION  III 
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<sect><title  ltvel-"0">  CCMS-II  Checkoul, 

<kp  woris-"l"  lype-"OENERAL  WORD" 
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<kp  OTrdE--l-  lypo-'OENERAL  WORD- 
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InTEXT  Precision  is  an  accel- 
erated INDEXING  AND  DYNAMIC 
CONTENT  AGENT  TOOLKIT  USED 
FOR  SIGNIFICANTLY  REDUCING 
FULL-TEXT  INDEXES  AND  AUTO- 
MATING HTML  &  SGML 
DOCUMENT  PREPARATION  FOR  ON- 
LINE PUBLISfflNG. 


"Precision's  automa- 
tion of  the  document 
preparation  process 
for  on-line  publish- 
ing and  retrieval 
vendors  is  extremely 
financially  and 
commercially  viable. 
What  were  formerly 
labor  intensive 
activities,  such  as 
keyword  generation 
and  HTML  conver- 
sion, are  automatic 
with  Precision. " 


InTEXT 


Intelligent  Keyword  Generation  Facility 

InTEXTPiecision  generates  high-quality  keywords  from  docu- 
ments without  bias.  For  example,  in  an  authoring  system,  an 
application  can  use  Precision  to  automatically  fill  in  the  key- 
words rather  than  forcing  the  user  to  do  so.  Traditionally,  this 
valuable  text  retrieval  technique  involves  a  labor-intensive 
process,  often  frustrating  end  users  and  resulting  in  inaccurate 
search  criteria.  Precision's  automation  of  this  process  makes  it 
much  more  user  friendly  and  commercially  viable. 

Index  Reduction  for  High  Precision  and  Recall 

/nTEXrPrecision  utilizes  a  unique  PreciseScoping™  technology 
that  generates  a  significantly  smaller  full-text  index  than  any 
other  commercial  product.  Precision  does  this  by  automatically 
creating  content-aware,  summarized  documents. 

These  unique,  surmnarized  documents  can  be  used  in  any  high 
precision  full-text  retrieval  application,  enabling  a  significant 
increase  in  precision  while  producing  indexes  that  are  5  to  10 
times  smaller. 


/nraxrPrecision  is  unique  in  the 
way  it  tackles  precision  retrieval  and 
recall  from  the  indexing  end  rather 
than  the  querying  end.  The  summa- 
rized documents,  with  the  most 
content-bearing  words  intact,  are 
indexed  rather  than  the  entire 
document.  Because  of  the  dramati- 
cally smaller  index,  retrieval  speed 
is  improved.  The  content  knowledge 
of  the  summarized  document 
improves  relevancy  ranking  or 
"hits"  of  documents  that  fit  a 
particular  query. 


/nr^Xr  Precision  offers  all  full-text 
retrieval  vendors  and  on-line 
publishers  the  opportunity  to 
dramatically  improve  the  accuracy 
of  their  retrieval  system  without 
sacrificing  recall. 

Automatic  Conversion  to 
Standard  Formats 

While  automatically  converting 
documents  into  standard  interchange 
and  publishing  formats,  InTEXT 
Precision  streamlines  the  creation  of 


logical  structure  identifications  and 
SGML,  HTML  and  keyword  tags. 
Additionally,  Precision  enables 
application  developers  to  automate 
hypertext  linking.  This  markup 
feature  greatly  reduces  the  efforts 
required  in  publishing  to  the  World 
Wide  Web  and  the  Internet. 

Software  Development  Kit 

/nrjEXT  Precision  is  available  as  an 
SDK,  with  C  libraries  and  documen- 
tation, for  UNIX  and  Windows 
operating  systems. 
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Till  lull  \[  l'ii.(.i\iim  iiutiil  U (  hiu>l()i\ 
automatically  creates  content-aware, 
HTML  marked  up,  and  "precise  indexed" 
documents. 


Features 


Benefits 


Content-Aware  Document  Preparation 


Automatic  SGML  and  HTML  Conversion 


Reduction  in  Index  Size 


Automatic  keywording 


Open  Architecture 


Increased  precision  and  recall 
in  document  search  and  retrieval 
Documents  ready  for  web  clustering, 
routing,  crawling  to  get  users  the  most 
up-to-date  information  possible 


Documents  instantly  ready  for  Internet 

and  Web  publishing 

Standardized  for  worldwide  document 

exchange 


•  Storage  overhead  reduction 

•  Faster  search  and  retrieval 

•  Maintenance  reduction 

•  Increase  in  computer  responsiveness 


•  Reduces  staff  tasking  significantly 

•  Increases  keywording  accuracy 

•  Future-proofs  end  user  applications  by 
automatically  filling  in  word  processing/ 
document  authoring  keyword  prompts 


•  Built  on  Heuristic/Learning  architecture 
so  content  knowledge  is  inherent 

•  API  library  for  customization 


InTEXT 
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Object  Analyzer:  Intelligent  Summarizing  Agent 


InTEXT  Object  Analyzer 
provides  real-time  content 
analysis  and  summarization  of 
documents.  object  analyzer 
is  a  powerful  tool  for  access- 
ing the  most  crucial  parts  of 
information,  across  the 
enterprise  and  the  internet. 


"InTEXT's  useful 
summarizing  tool 
gives  users  an 
excellent  way  to  cut 
down  on  the  sheer 
volume  of  docu- 
ments. ...We  found 
the  heuristic 
algorithms  InTEXT 
uses  to  create 
summaries  to  he 
quite  sophisti- 
cated. " 

-  PC  Week,  Oct  '94 


InTEXT 


On-the-Fly  Summarization 

Object  Analyzer  identifies  the  major  information  content  of  large 
documents  and  creates  precise  synopses  of  those  documents. 
Using  InTEXT s  unique  heuristic  technology,  Object  Analyzer 
understands  the  "aboutness"  of  documents  and  deduces  the  most 
important  information  on  the  fly.  No  front-loading  of  dictionaries 
or  index  building  is  required.  Object  Analyzer  automatically 
creates  summaries  from  1-99  percent  of  the  original. 

Document  Assimilation 

Object  Analyzer  enables  users  to  comprehend  complex  and 
lengthy  documents  by  extracting  the  document's  main  focus.  It 
performs  analyses  by  reading  the  entire  document  and  objectively 
rating  every  word,  phrase  and  sentence  by  its  contribution  to  the 
document's  central  theme  or  themes. 

Dynamic  Keyword  indexing 

Object  Analyzer  automatically  creates  a  coherent  index  of  a 
document's  key  words  and  phrases.  Users  can  display,  save  and 


print  this  index,  complete  with  word 
variation,  word  frequency  and 
alphabetical  listing  options. 

Content  Knowledge 
Expansion 

While  Object  Analyzer's  content 
knowledge  is  built-in  and  domain 
independent,  it  can  also  be  tuned  to 
make  more  accurate  and  comprehen- 
sive extractions  from  documents 
associated  with  particular  topics  and 
disciplines.  For  example,  medical 


terms  can  be  added  and  dynamically 
called  upon  during  document 
analysis. 

Automatic  Hypertext 
Preparation 

Object  Analyzer  generates  auto- 
matic hypertext  links  from  the 
keyword  index  to  associated  words 
within  a  summary.  And,  Object 
Analyzer  can  output  abstracts  in 
SGML  format.  It  marks  all  in- 
stances of  key  phrases  and  key 


sentences,  and  it  represents  key 
phrases  grouped  by  their  syntactic 
variants  and  sub-phrase  families. 

Open  Arciiitecture 

For  application  customization,  the 
Object  Analyzer  Software  Develop- 
ment Kit  (SDK)  contains  a  C 
language  subroutine  library  that 
allows  programmatic  access  to  the 
Object  Analyzer's  key  features.  The 
SDK  is  available  for  UNDC  and 
Windows. 


Features 


Benefits 


The  Object  Analyzer  summarizing  agent 
understands  the  content  of  documents  and 
creates  accurate  summaries  on  the  fly.  It 
also  creates  keyword  indexes  automati- 
cally. 


On-the-Fly  Summarization 


Automatic  Keyword  Generation 


Powerful  Application  Development  Base 


•  Objective  interpretation  of  documents 

•  No  time-consuming  preparation 

•  No  dictionaries  or  indexing  required 

•  Instant  access  to  the  most  important 
pieces  of  information 

•  1-99%  summaries  of  the  original 

•  Read  thumbnail  versions  of  Internet  and 
Web  documents 


•  Easy  keyword  lookup  for  fast  research 

•  Documents  already  prepared  for 
keyword  assignments  in  text  retrieval 
engine — tremendous  timesaver 

•  Internal  hypertext  linking  and  SGML 
generation  for  standardization  ease 

•  Can  perform  unique  "next"  and  "last" 
keyword  searches 


Future-proofed  enterprise  investment 
Efficient  Internet  publishing  preparation 
and  utilization 
API  library  for  customization 
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